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Qualcomm? HW Accelerated 3D: Adreno 


* Moving content-quality forward requires hardware acceleration 
— Upto 1024x/768 screen-resolution by mid-2009 
— Blend effects and composition of 3D with other media types 


— Multiple texture support with combiners 


* Qualcomm is enabling Hardware 3D on all its chipset tiers 
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Adreno Product Family 





Gear 
- OpenGL-ES 1.0 
= ums 
* Adreno 100 and Adreno 110 aula A 


* Commercial for 3 years, high volumes 





* Entry-level hardware acceleration 


- OpenGL-ES 1.0+/OpenGL-ES 1.1 
- Adreno 120 and Adreno 130 
- Commercial now in Asia, entering US by July 


* High performance fixed function pipeline with texture combiners and 
matrix palette extensions 


- OpenGL-ES 2.0 
- ÃAdreno 200, Adreno 210 and Adreno 220 
* (Commercial end of this year 


* High performance, flexible shader pipeline 
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Adreno Graphics Platform 








So many devices 
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Challenge for developers 


Current HW accelerated 3D Content 


À - Main SKU is Software 


7 
4 * HWis treated as an incremental feature: e.g. bilinear filtering, 
marginally higher-res textures 


s º 


Engines that do support HW do so with orthogonal render paths 





Market is shifting, will you be able to make the transition from 
incremental hardware support, to full support? 


va 


(5) Problems 
* Its an embedded device, you get the best performance from coding 


G 


to the hardware 





- But: HW manufacturers dont make it easy e 
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Architecture of one platform... 





Adreno 100, 110 


A | (App) (Vertex T&L) (Rasterization) (Frame Swaps) 


Adreno 100/110 
GPU 








Another platform... 
Adreno 130 


Adreno 130 


External TalsidaEl 
N/[ Toa OTA W/[ Toa OTA 





Qualcomm's Adreno Tools 





Adreno tools 














7% ê * Lowering cost of tuning content for HW 
e 
e, / * Allthe tools necessary to create cutting-edge SD content for Adreno platforms 
Create Content e 
* A Pã AD Optimize 
"O =D | Seometya 
| sam dá Textures 
ã Code 
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Tune Performance / VN 
=== Deploy Simulate 
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Adreno Tool Packages 


z« Developed alongside the hardware: brought up alongside drivers, and system 


| | o Dado 
OX Engine” SDK FROMSOFTWARE  9/.... Wºhrc 
“Maya, 3DS Max Exporters e Ep 
“Particle System Editor Ore E DEAROSYS Vivara senas contiai 
“QStrip Triangle Stripper . | | 
“QXTextureBuilder texture optimizer KONAMI CAPGSM VE 


“Full rendering and animation engine 





rem 127 
«Full Source Code TÁITO VA Tectoy PSnmsunad 
sesa +ê» HUDSON 








Tune 





“| Adreno Profiler 
“On-Device profiling 
“Directed analysis 


Pg a (E “Hardware metric access 
O) Elsa “API Traces 


“Powerful debugging features 
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Performance optimization using Adreno Tools 





First Prototype Woes 








'After porting, Em getting 5 FPS?" 
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Graphics Pipeline 
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Geomet RR ss | Fragment 
E É = E q Rasterizer a 
fertex ERC RE tion PTOCRESM dijo] 


E Adreno 100/110 : 
QDSP-4 a! Disp Ctrl 
a vexture 
Vertices Adreno 130 |" Disp Ctrl 
GPU v2 











CPU 
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Driver Instrumentation 
E Adreno Profiler'M PC Client 


































































































































































































































































































































































































































































































* Graphics driver is extensively instrumented 
* No application changes required 


e Available on commercial devices 
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Case Study: Neocore Optimization 





- Platform: Q3Dimension R4.0 
ê (MSM7201) 
) 


* Initial performance: 5-10fps 


* - After optimization: 30fps 
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Graphics Pipeline 
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* . 


Geometry Geometry 





cel [o (o Processor 


Vertices 
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Processor 


Texture 


storage + Pixels 
ali 
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QXTextureBuilder Tool 





— OXTextureBuilder - Cv bia -TeamicontentisamplesiMSM 7500 ÓGLES qcom sd... JOE 
Q File Output Format advanced vigm About, 








* Easily converted all textures to mipmapped + ATITC 
— Memory footprint reduced by 75% 
* Huge GPU caching benefit 
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QStrip Tool 





. 





*- Converted all meshes from discrete triangles to triangle strips 
- Also enabled frustum culling in QX Engine 
= 


/* compute strip */ 


Qstrip* pStripify = QstripCreate(); 





ushort* dStrip = QstripComputeStrips (pStripify, indexnum, indexdata); 
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ulong nStripLen = QstripGetStripLength (pStripifvy); 
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HW and system-level real-time performance metrics 
* Powerful frame analysis and debugging features 


* Supports current and future Qualcomm Adreno platforms, 
including upcoming OpenGLES v2-based cores 


* Available Today on commercial devices 





QUA LCOMMº 20 e 








q“ 1; ã . 
a! * E z k E 
o] 4 = a Fa 4 
É * à 
a m a. É + 


DBRILW 2006 CONFERPINÇCE 


Thank You! 
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7 W/ 
“4 ro Join Qualcomm's Adreno 
Developer Program, simply email: 












eo Tim Leland 
tlelandOqualcomm.com 








Better Ecosystem 
increases revenue for 
all partners 











Layered Textures 
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QX Engine 1.2 Features for Adreno 120/130 


Specular Mapping 








QXResourceMgr 











st be loaded w 


Models mu ith ResourceMgr configured to 
QX RESOURCE USAGEFLAG RETAIN ACCESSIBLE GEOMETRY 





QXModel 
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QXLightMgr 




















Choose Load Balancing Mode and Submit to LightMgr 


New APIs and utilities 


ss 


5 


Fê 


Wr. 
» 


Case Study- Step 1 


Neocore Build 1 
Metrics graph: 11 FPS 
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Bottleneck is in the texture fetch stage 
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Unoptimized 
Textures 
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Forcing 1x1 Textures jumps performance to 22 FPS 
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Profiler statistics: Statistics show textures are not ideal 
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Case Study- Step 2 


Neocore Build 2 





ú — Used QX Texture Converter to create ATI TC, Mipmapped textures 
| — Metrics graph: 20 FPS 


— We have moved the bottleneck, it is no longer in the texture pipeline 
— Statistics gathering hints that triangle-strips are not being used 


— Pulling back the camera shows unnecessary off-camera rendering 
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o — Our bottleneck is in the front-end: too many unnecessary polys + not optimized 








Off-camera geomet 
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Unoptimized 
Triangles 
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Case Study- From 11 to 30 FPS in 3 steps 





Neocore Build 3 
ê — Used QStrip to generate triangle strips, added QX Engine frustum culling 
4d | — Metrics graph: 30 FPS 


— We're done 
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Culled geometry 
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Optimized Triangles EEE 
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Optimized Textures 
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